Range-constrained Phase Reconstruction for Recovering Time-domain Signal from Quantized Amplitude and Phase Spectrogram
نویسندگان
چکیده
This paper describes a novel algorithm for recovering time-domain signal from quantized amplitude and phase spectrogram, which is applicable for spectrogram-based audio coding. In order to obtain a better quality sound, a phase reconstruction technique is first applied with constraint for keeping phase in each time-frequency bin within each quantization range, and then, time-domain signal is recovered by the standard inverse short-time Fourier transform. Experimental evaluation based on the objective PEAQ measure shows that the proposed range-constrained phase reconstruction is effective for improving the sound quality.
منابع مشابه
Beyond NMF: Time-Domain Audio Source Separation without Phase Reconstruction
This paper presents a new fundamental technique for source separation of single-channel audio signals. Although nonnegative matrix factorization (NMF) has recently become very popular for music source separation, it deals only with the amplitude or power of the spectrogram of a given mixture signal and completely discards the phase. The component spectrograms are typically estimated using a Wie...
متن کاملSpeech Reconstruction from Binary Masked Spectrograms Using Vector Quantized Speaker Models
Several source separation techniques use binary masking on spectrograms to separate two or more speakers from each other. In this thesis, the possibilities for obtaining the best quality signal, reconstructed from masked spectrograms through vector quantized models of speakers, is investigated. The advantages and disadvantages of such an approach are examined. Additionally, the task of signal r...
متن کاملPhase initialization schemes for faster spectrogram-consistency- based signal reconstruction
In previous contributions [1, 2], we presented a fast algorithm for the reconstruction of a time-domain signal from a magnitude short-time Fourier transform (STFT) spectrogram, and showed experimentally that it could lead to substantial reduction in computation time compared to previous work by Griffin and Lim [3] when both algorithms are initialized using zero phase. We study here other strate...
متن کاملPhase-Encoded Speech Spectrograms
Spectrograms of speech and audio signals are time-frequency densities, and by construction, they are non-negative and do not have phase associated with them. Under certain conditions on the amount of overlap between consecutive frames and frequency sampling, it is possible to reconstruct the signal from the spectrogram. Deviating from this requirement, we develop a new technique to incorporate ...
متن کاملExplicit consistency constraints for STFT spectrograms and their application to phase reconstruction
As many acoustic signal processing methods, for example for source separation or noise canceling, operate in the magnitude spectrogram domain, the problem of reconstructing a perceptually good sounding signal from a modified magnitude spectrogram, and more generally to understand what makes a spectrogram consistent, is very important. In this article, we derive the constraints which a set of co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012